High Dimensional Data Differential Privacy Protection Publishing Method Based on Association Analysis

نویسندگان

چکیده

In order to solve the problem of privacy disclosure when publishing high-dimensional data and protect frequent itemsets in association rules, a method based on rules (PDP Growth) is proposed. This method, distributed framework, utilizes rough set theory improve mining rules. It optimizes analysis while reducing dimensionality data, eliminating more redundant attributes, obtaining concise itemsets, uses exponential mechanism differential simplest itemset obtained, effectively protects by adding Laplace noise its support. The validates that satisfies requirement protection. Experiments multiple datasets show this can efficiency meet Finally, results requirements are published.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Publishing Set-Valued Data via Differential Privacy

Set-valued data provides enormous opportunities for various data mining tasks. In this paper, we study the problem of publishing set-valued data for data mining tasks under the rigorous differential privacy model. All existing data publishing methods for set-valued data are based on partitionbased privacy models, for example k-anonymity, which are vulnerable to privacy attacks based on backgrou...

متن کامل

LoPub: High-Dimensional Crowdsourced Data Publication with Local Differential Privacy

Nowadays, with the development of various integrated sensors and crowd sensing systems, the crowdsourced information from all aspects can be collected and analyzed among various data attributes to better produce rich knowledge about the group [19], [47], thus benefiting everyone in the crowdsourced system [24]. Particularly, with high-dimensional crowdsourced data (data with multiple attributes...

متن کامل

Privacy-preserving data publishing for cluster analysis

Releasing person-specific data could potentially reveal sensitive information about individuals. k-anonymization is a promising privacy protection mechanism in data publishing. Although substantial research has been conducted on k-anonymization and its extensions in recent years, only a few prior works have considered releasing data for some specific purpose of data analysis. This paper present...

متن کامل

Privacy - Preserving Data Publishing

The success of data mining relies on the availability of high quality data. To ensure quality data mining, effective information sharing between organizations becomes a vital requirement in today's society. Since data mining often involves person-specific and sensitive information like medical records, the public has expressed a deep concern about their privacy. Privacy-preserving data publishi...

متن کامل

Methods for regression analysis in high-dimensional data

By evolving science, knowledge and technology, new and precise methods for measuring, collecting and recording information have been innovated, which have resulted in the appearance and development of high-dimensional data. The high-dimensional data set, i.e., a data set in which the number of explanatory variables is much larger than the number of observations, cannot be easily analyzed by ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2023

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12132779